Search CORE

148 research outputs found

Using the online cross-entropy method to learn relational policies for playing different games

Author: Driessens Kurt
Pfahringer Bernhard
Sarjant Samuel
Smith Tony C.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

By defining a video-game environment as a collection of objects, relations, actions and rewards, the relational reinforcement learning algorithm presented in this paper generates and optimises a set of concise, human-readable relational rules for achieving maximal reward. Rule learning is achieved using a combination of incremental specialisation of rules and a modified online cross-entropy method, which dynamically adjusts the rate of learning as the agent progresses. The algorithm is tested on the Ms. Pac-Man and Mario environments, with results indicating the agent learns an effective policy for acting within each environment

Maastricht University Research Portal

Research Commons@Waikato

Workshop on Rich Representations for Reinforcement Learning:Held in conjunction with the 22nd International Conference on Machine Learning, August 7, 2005, Bonn, Germany

Author: Driessens Kurt
Fern Alan
van Otterlo Martijn
Publication venue: University of Bonn
Publication date: 01/01/2005
Field of study

University of Twente Research Information

Contextual Encoder-Decoder Network for Visual Saliency Prediction

Author: Driessens Kurt
Goebel Rainer
Kroner Alexander
Senden Mario
Publication venue: 'Elsevier BV'
Publication date: 18/02/2019
Field of study

Predicting salient regions in natural images requires the detection of objects that are present in a scene. To develop robust representations for this challenging task, high-level visual features at multiple spatial scales must be extracted and augmented with contextual information. However, existing models aimed at explaining human fixation maps do not incorporate such a mechanism explicitly. Here we propose an approach based on a convolutional neural network pre-trained on a large-scale image classification task. The architecture forms an encoder-decoder structure and includes a module with multiple convolutional layers at different dilation rates to capture multi-scale features in parallel. Moreover, we combine the resulting representations with global scene information for accurately predicting visual saliency. Our model achieves competitive and consistent results across multiple evaluation metrics on two public saliency benchmarks and we demonstrate the effectiveness of the suggested approach on five datasets and selected examples. Compared to state of the art approaches, the network is based on a lightweight image classification backbone and hence presents a suitable choice for applications with limited computational resources, such as (virtual) robotic systems, to estimate human fixations across complex natural scenes.Comment: Accepted Manuscrip

arXiv.org e-Print Archive

Maastricht University Research Portal

ZENODO

Integrating Guidance into Relational Reinforcement Learning

Author: Kurt Driessens
Sašo Džeroski
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

Author: Broeck Guy Van den
Driessens Kurt
Publication venue
Publication date: 01/09/2011
Field of study

Maastricht University Research Portal

Searching for ring-like structures in the Cosmic Microwave Background

Author: Bonizzi Pietro
Driessens Kurt
Koekoek Gideon
Lopez Melissa
Vries Jacco de
Westra Ronald
Publication venue
Publication date: 09/05/2021
Field of study

In this research we present a new methodology to search for ring-like structures in the CMB. The particular context of this work is to investigate the presence of possible observational effects associated with Conformal Cyclic Cosmology (CCC), known as Hawking points. Although our results are not conclusive due to the statistical disagreement between the CMB sky map and the simulated sky maps in accordance to

\Lambda CDM

, we are able to retrieve ring-like anomalies from an artificial data at

95 \%

confidence level. Once this discrepancy has been assessed, our method may be able to provide evidence of the presence or absence of Hawking points in the CMB. Hence, we stress the need to continue the theoretical and experimental research in this direction

Maastricht University Research Portal

Introduction to the special issue dedicated to the Journal Track of ECML PKDD 2017

Author: Dragi Kocev
Kurt Driessens
Marko Robnik-Šikonja
Myra Spiliopoulou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning Structural Kernels for Natural Language Processing

Author: Bergstra James
Cancedda Nicola
Chang Chih-Chung
Chu Wei
Driessens Kurt
Gönen Mehmet
Igel Christian
Lodhi Huma
Pang Bo
Pedregosa Fabian
Specia Lucia
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2015
Field of study

Structural kernels are a flexible learning paradigm that has been widely used in Natural Language Processing. However, the problem of model selection in kernel-based methods is usually overlooked. Previous approaches mostly rely on setting default values for kernel hyperparameters or using grid search, which is slow and coarse-grained. In contrast, Bayesian methods allow efficient model selection by maximizing the evidence on the training data through gradient-based methods. In this paper we show how to perform this in the context of structural kernels by using Gaussian Processes. Experimental results on tree kernels show that this procedure results in better prediction performance compared to hyperparameter optimization via grid search. The framework proposed in this paper can be adapted to other structures besides trees, e.g., strings and graphs, thereby extending the utility of kernel-based methods

arXiv.org e-Print Archive

Crossref

Publikationer från Uppsala Universitet

Edinburgh Research Explorer

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Spiral - Imperial College Digital Repository

White Rose Research Online

University of Melbourne Institutional Repository